feat: Revamp wait for animation to end tool + tiny fixes #92

plfavreau · 2025-10-17T10:00:43Z

🚀 What's new?

Revamping the wait for animation to end tool with a somple wait for dealy (sleep).
Tiny fixes to improve mobile-use execution.

🤔 Type of Change

What kind of change is this? Mark with an x

Bug fix (non-breaking change that solves an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update to the docs)

✅ Checklist

Before you submit, please make sure you've done the following. If you have any questions, we're here to help!

I have read the Contributing Guide.
My code follows the project's style guidelines (ruff check . and ruff format . pass).
I have added necessary documentation (if applicable).

💬 Any questions or comments?

Have a question or need some help? Join us on Discord!

Summary by CodeRabbit

Removed Features
- Screenshots are no longer included in agent context or message composition.
- Animation-detection wait behavior removed.
New Features
- Added a millisecond-configurable delay operation to pause execution.
Breaking Changes
- Animation-based waiting replaced by a fixed delay approach.
- Agent state no longer stores latest screenshot data.

…unction using time.sleep

…it_for_delay tool

…ait_for_delay tool

…te and context

coderabbitai · 2025-10-17T10:01:07Z

Walkthrough

Removed tracking and use of screenshot Base64 across state, agents, and SDK; replaced the animation-wait mechanism with a millisecond delay API and added a wait-for-delay tool and wrapper.

Changes

Cohort / File(s)	Summary
Screenshot state removal `minitap/mobile_use/graph/state.py`, `minitap/mobile_use/sdk/agent.py`	Removed the `latest_screenshot_base64` field from the shared State and its initialization in graph state construction.
Screenshot context removal `minitap/mobile_use/agents/contextor/contextor.py`, `minitap/mobile_use/agents/cortex/cortex.py`	Removed imports and logic that conditionally added screenshot Base64 to context and LLM messages; removed corresponding state update keys and logging tied to screenshots.
Wait mechanism refactor (controller) `minitap/mobile_use/controllers/mobile_command_controller.py`	Removed `WaitTimeout` enum and `wait_for_animation_to_end`; added `wait_for_delay(time_in_ms: int)`; replaced enum-based timeout usage with fixed 500ms in one path; added `time` import and removed `latest_screenshot_base64` from dummy state.
Tool wrapper swap (index) `minitap/mobile_use/tools/index.py`	Replaced import/reference of `wait_for_animation_to_end_wrapper` with `wait_for_delay_wrapper` in `EXECUTOR_WRAPPERS_TOOLS`.
Wait tool addition `minitap/mobile_use/tools/mobile/wait_for_delay.py`	Added `get_wait_for_delay_tool` exposing `wait_for_delay` tool and `wait_for_delay_wrapper`; validates/clamps `time_in_ms`, delegates to controller in a thread, records ToolMessage and agent thoughts in executor state, and provides success/failure messages.
Config merge validation `minitap/mobile_use/config.py`	Reworked deep_merge_llm_config to validate override keys, warn on unknown keys, and only recurse when both values are dicts.

Sequence Diagram(s)

sequenceDiagram
  participant Agent as Agent / LLM
  participant Wrapper as wait_for_delay_wrapper
  participant Tool as wait_for_delay (tool)
  participant Controller as wait_for_delay_controller
  participant Graph as Graph State

  Note over Wrapper,Tool: wait_for_delay tool flow
  Agent->>Wrapper: request wait_for_delay(time_in_ms)
  Wrapper->>Tool: invoke wait_for_delay(tool_call_id, state, agent_thought, time_in_ms)
  Tool->>Controller: run wait_for_delay_controller(time_in_ms) (async/thread)
  alt success
    Controller-->>Tool: success
    Tool->>Graph: update EXECUTOR_MESSAGES_KEY and agent_thoughts (success)
    Tool-->>Wrapper: success result
    Wrapper-->>Agent: "Successfully waited for X milliseconds."
  else failure
    Controller-->>Tool: error
    Tool->>Graph: update EXECUTOR_MESSAGES_KEY and agent_thoughts (failure)
    Tool-->>Wrapper: failure result
    Wrapper-->>Agent: "Failed to wait for delay."
  end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

feat: screen analyzer agent - cortex reasoning agent does not need to be a VLM anymore #85 — touches cortex and screenshot handling; strongly related to removal of screenshot logic.
feat: adding adb direct commands for core actions #87 — modifies controller wait/run flow and likely overlaps the wait-for-animation → delay changes.

Suggested reviewers

KYN4
cguiguet

Poem

"I'm a rabbit in the CI lane, small and spry,
Screenshots tucked away — I wave them by.
Milliseconds counted, I hop and wait,
A tiny delay makes workflows great. 🐇✨"

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 25.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The pull request title "feat: Revamp wait for animation to end tool + tiny fixes" directly aligns with the primary change in the changeset, which is replacing the wait-for-animation-to-end tool with a simpler wait-for-delay implementation. The title is concise, specific, and clearly indicates that the wait-for-animation-to-end tool is being revamped rather than removed, which matches the actual refactoring that occurs across multiple files. The phrase "tiny fixes" appropriately captures the secondary changes including the removal of the latest_screenshot_base64 field and configuration updates, setting proper expectations for the scope of the PR.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/revamp-wait-for-animation-to-end-tool

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

minitap/mobile_use/tools/mobile/wait_for_delay.py (1)
40-42: Add upper bound validation and document the default value.

The validation sets negative values to 1000ms, but this isn't mentioned in the docstring. Additionally, there's no upper bound check here (though this should ideally be enforced in the controller at lines 558-561 of minitap/mobile_use/controllers/mobile_command_controller.py).

Apply this diff to add an upper bound and improve clarity:
+        # Validate and clamp time_in_ms to reasonable bounds
         if time_in_ms < 0:
             time_in_ms = 1000
+        elif time_in_ms > 10000:
+            time_in_ms = 10000
         output = wait_for_delay_controller(time_in_ms)
Also update the docstring to document this behavior:
         Args:
-            time_in_ms: The number of milliseconds to wait.
+            time_in_ms: The number of milliseconds to wait (negative values default to 1000ms, capped at 10000ms).

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b20e617 and 9289445.

📒 Files selected for processing (7)

minitap/mobile_use/agents/contextor/contextor.py (0 hunks)
minitap/mobile_use/agents/cortex/cortex.py (0 hunks)
minitap/mobile_use/controllers/mobile_command_controller.py (3 hunks)
minitap/mobile_use/graph/state.py (0 hunks)
minitap/mobile_use/sdk/agent.py (0 hunks)
minitap/mobile_use/tools/index.py (2 hunks)
minitap/mobile_use/tools/mobile/wait_for_delay.py (1 hunks)

💤 Files with no reviewable changes (4)

minitap/mobile_use/graph/state.py
minitap/mobile_use/agents/cortex/cortex.py
minitap/mobile_use/agents/contextor/contextor.py
minitap/mobile_use/sdk/agent.py

🧰 Additional context used

🧬 Code graph analysis (3)

minitap/mobile_use/tools/index.py (2)

minitap/mobile_use/controllers/mobile_command_controller.py (1)

wait_for_delay (558-561)

minitap/mobile_use/tools/mobile/wait_for_delay.py (1)

wait_for_delay (20-64)

minitap/mobile_use/tools/mobile/wait_for_delay.py (4)

minitap/mobile_use/context.py (1)

MobileUseContext (57-74)

minitap/mobile_use/controllers/mobile_command_controller.py (1)

wait_for_delay (558-561)

minitap/mobile_use/graph/state.py (2)

State (20-88)

asanitize_update (64-88)

minitap/mobile_use/tools/tool_wrapper.py (1)

ToolWrapper (9-12)

minitap/mobile_use/controllers/mobile_command_controller.py (1)

minitap/mobile_use/tools/mobile/wait_for_delay.py (1)

wait_for_delay (20-64)

🔇 Additional comments (3)

minitap/mobile_use/tools/index.py (1)

15-17: LGTM! Clean migration to the delay-based tool.

The import and reference changes correctly reflect the transition from animation-based waiting to a simple delay mechanism.

Also applies to: 32-32
minitap/mobile_use/tools/mobile/wait_for_delay.py (1)
42-48: Note: Error handling path is currently unreachable.

The controller function wait_for_delay_controller (defined at lines 558-561 in mobile_command_controller.py) always returns None on success and doesn't catch exceptions. This means has_failed will always be False, and the failure handling path (on_failure_fn) is never executed. If the sleep operation were to fail (unlikely but possible with system interrupts), it would raise an exception rather than returning an error value.

This is likely acceptable for the current implementation, but if you want robust error handling, consider wrapping the controller call in a try-except:
try:
    output = wait_for_delay_controller(time_in_ms)
    has_failed = False
except Exception as e:
    output = str(e)
    has_failed = True
Otherwise, you might simplify the code by removing the unused error handling:
output = wait_for_delay_controller(time_in_ms)
agent_outcome = wait_for_delay_wrapper.on_success_fn(time_in_ms)
tool_message = ToolMessage(
    tool_call_id=tool_call_id,
    content=agent_outcome,
    status="success",
)
minitap/mobile_use/controllers/mobile_command_controller.py (1)

564-572: I need to gather more context about the PR changes and the intent behind animation-based waiting. Let me check if there are any other relevant files or commits that explain the migration.

The review comment is based on incomplete code inspection and should be disregarded.

The original concern states the function "still appends a waitForAnimationToEnd command with a hardcoded 500ms timeout," but this is inaccurate. The function appends this command only when explicitly requested via the wait_for_animation_to_end parameter (which defaults to False).

Of the 6 call sites identified:

launch_app (line 502-504) explicitly passes wait_for_animation_to_end=True

All other callers (long_press, stop_app, open_link, back, press_key) rely on the default behavior without animation waiting

The Maestro documentation recommends avoiding fixed waits and reserving waitForAnimationToEnd for specific cases, making the conditional approach in launch_app appropriate for that operation. The 500ms timeout is reasonable within Maestro's design model.

Likely an incorrect or invalid review comment.

minitap/mobile_use/controllers/mobile_command_controller.py

cguiguet

ok for me, but the rabbit comment is relevant

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

minitap/mobile_use/tools/mobile/wait_for_delay.py (1)
20-20: Add return type annotation to match ToolWrapper expectation.

ToolWrapper.tool_fn_getter expects Callable[[MobileUseContext], BaseTool]. Annotate and import BaseTool.
-from langchain_core.tools import tool
+from langchain_core.tools import tool, BaseTool
@@
-def get_wait_for_delay_tool(ctx: MobileUseContext):
+def get_wait_for_delay_tool(ctx: MobileUseContext) -> BaseTool:

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 9289445 and c8b858e.

📒 Files selected for processing (1)

minitap/mobile_use/tools/mobile/wait_for_delay.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

minitap/mobile_use/tools/mobile/wait_for_delay.py (4)

minitap/mobile_use/context.py (1)

MobileUseContext (57-74)

minitap/mobile_use/controllers/mobile_command_controller.py (1)

wait_for_delay (558-561)

minitap/mobile_use/graph/state.py (2)

State (20-88)

asanitize_update (64-88)

minitap/mobile_use/tools/tool_wrapper.py (1)

ToolWrapper (9-12)

🔇 Additional comments (2)

minitap/mobile_use/tools/mobile/wait_for_delay.py (2)

61-69: ****

The review comment is based on a misunderstanding of the AgentNode type. AgentNode is defined as Literal["planner", "orchestrator", "cortex", "screen_analyzer", "executor"], not an enum class with attributes like AgentNode.executor.

Since AgentNode is a Literal type of strings, passing agent="executor" is correct and fully type-compatible with the function signature agent: AgentNode | None = None. The code in wait_for_delay.py at lines 61-69 follows the same pattern as all other callers in the codebase and requires no changes.

Likely an incorrect or invalid review comment.

54-59: Status parameter is supported in langchain-core 0.3.75—review comment is incorrect.

The project requires langchain-core>=0.3.75, which supports the status parameter on ToolMessage. The current code in wait_for_delay.py (lines 54–59) correctly uses status="error" if has_failed else "success", and this is consistent with other files in the codebase (e.g., launch_app.py, tool_node.py). Moving status to additional_kwargs would contradict the supported API and is unnecessary.

Likely an incorrect or invalid review comment.

minitap/mobile_use/tools/mobile/wait_for_delay.py

…keys

cguiguet

LGTM !

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

minitap/mobile_use/tools/mobile/wait_for_delay.py (1)
44-47: Document or reconsider the negative-value default.

Negative time values are silently coerced to 1000ms, which is arbitrary and undocumented in the docstring. Consider either:

Raising a ValueError for negative inputs to catch caller bugs, or

Defaulting to 0ms instead of 1000ms for more predictable behavior

Update the docstring to document whichever behavior you choose.

Example if raising an error:
-        if time_in_ms < 0:
-            time_in_ms = 1000
+        if time_in_ms < 0:
+            raise ValueError(f"time_in_ms must be non-negative, got {time_in_ms}")
Or if defaulting to 0:
-        if time_in_ms < 0:
-            time_in_ms = 1000
+        if time_in_ms < 0:
+            time_in_ms = 0

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c8b858e and d191029.

📒 Files selected for processing (2)

minitap/mobile_use/config.py (1 hunks)
minitap/mobile_use/tools/mobile/wait_for_delay.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (2)

minitap/mobile_use/config.py (1)

minitap/mobile_use/utils/logger.py (1)

warning (85-86)

minitap/mobile_use/tools/mobile/wait_for_delay.py (4)

minitap/mobile_use/context.py (1)

MobileUseContext (57-74)

minitap/mobile_use/controllers/mobile_command_controller.py (1)

wait_for_delay (558-561)

minitap/mobile_use/graph/state.py (2)

State (20-88)

asanitize_update (64-88)

minitap/mobile_use/tools/tool_wrapper.py (1)

ToolWrapper (9-12)

🔇 Additional comments (4)

minitap/mobile_use/config.py (1)

246-265: LGTM! Solid validation and diagnostic improvements.

The refactored merge logic correctly validates override keys against the base configuration and provides clear diagnostic messages with full path tracking. The recursive approach properly handles nested dictionaries while ignoring unsupported keys.

minitap/mobile_use/tools/mobile/wait_for_delay.py (3)

1-18: LGTM! Clean imports and reasonable delay cap.

The imports are appropriate for the async tool implementation, and the 60-second maximum delay cap prevents excessive blocking.

48-54: LGTM! Properly offloads blocking operation.

The use of asyncio.to_thread correctly prevents blocking the event loop, addressing the previous review concern. Exception handling is appropriate.

55-84: LGTM! Correct state update and wrapper integration.

The tool correctly:

Constructs agent outcomes using the wrapper's callback functions

Creates a ToolMessage with appropriate status and error details

Updates state through asanitize_update with the executor agent context

Configures the wrapper with matching lambda signatures

plfavreau added 4 commits October 17, 2025 11:42

refactor: replace waitForAnimationToEnd with simpler wait_for_delay f…

3467557

…unction using time.sleep

fix: add default delay of 1000ms when negative time is provided in wa…

c4de9b4

…it_for_delay tool

refactor: extract agent outcome logic and add to agents_thoughts in w…

177f7ab

…ait_for_delay tool

refactor: remove deprecated screenshot handling from mobile agent sta…

9289445

…te and context

plfavreau requested review from KYN4 and cguiguet October 17, 2025 10:00

coderabbitai bot reviewed Oct 17, 2025

View reviewed changes

minitap/mobile_use/controllers/mobile_command_controller.py Show resolved Hide resolved

plfavreau self-assigned this Oct 17, 2025

cguiguet approved these changes Oct 20, 2025

View reviewed changes

feat: add 60 second max delay cap to wait_for_delay tool

c8b858e

coderabbitai bot reviewed Oct 20, 2025

View reviewed changes

minitap/mobile_use/tools/mobile/wait_for_delay.py Outdated Show resolved Hide resolved

plfavreau added 3 commits October 20, 2025 18:41

feat: add async support and error handling to wait_for_delay tool

560250a

feat: add validation and logging for unsupported LLM config override …

7d1b876

…keys

style: remove trailing whitespace in deep_merge_llm_config function

d191029

cguiguet approved these changes Oct 20, 2025

View reviewed changes

coderabbitai bot reviewed Oct 20, 2025

View reviewed changes

cguiguet merged commit ad98ebd into main Oct 20, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Revamp wait for animation to end tool + tiny fixes #92

feat: Revamp wait for animation to end tool + tiny fixes #92

Uh oh!

plfavreau commented Oct 17, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Oct 17, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

cguiguet left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

cguiguet left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Revamp wait for animation to end tool + tiny fixes #92

feat: Revamp wait for animation to end tool + tiny fixes #92

Uh oh!

Conversation

plfavreau commented Oct 17, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 What's new?

🤔 Type of Change

✅ Checklist

💬 Any questions or comments?

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cguiguet left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cguiguet left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

plfavreau commented Oct 17, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 17, 2025 •

edited

Loading